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ABSTRACT 

Three techniques for assessing the adequacy of 
correlation matrices for factor analysis were applied to four 
examples from the literature. The methods compared were: (1) 
inspection of the off diagonal elements of the anti-image covariance 
matrix S(to the 2nd) R(to the -1) and S(to the 2nd); (2) the Measure 
of Sampling Adequacy (M.S.A.)# and (3) Bartletts Test of Sphericity. 
Of the four matrices used for the study, two were comprised of eight 
variables and one each of fourteen and twenty. The sample sizes 
ranged from 50 to over 3,000. The results indicated that the three 
methods for overall assessment yeidlded comparable results. It was 
recommended, however, that methods for individual variables 
assessment also be used. (Author) 
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Background and Methods ; 

Recently attention has been focused upon the psychometric assessment of 
correlation matrices prior to the application of factor^analysis (Tobias 
and Carlson 1969, Cooley and Lohnes 1970, Kaiser 1971, Dziuban and Harris 
ly/z) . I he practice is often recommended (Tobias and Carlson lyby , 
Cooley and Lohnes 1971) but the results of such procedures are rather 
infrequently found in the literature. It Was the purpose of this study 
to empirically compare the results of such techniques v^hen applied to ' 
four well known correlation matrices. 

The initiul procedure involved the application of Bartlett's 
Test of Sphericity, Explanations of the test may be found In Kendall ' 
(1957), Anderson (1958), Cooley and Lohnes (1971). Maxwell (1959), and 
Tobias and Carlson (19^9) have recommended the test be used by 
psychologists prior to the application of factor analysis. It is 
computed by the formula: 

-j^(N-l)-l/6(2P+5j loge R 
where N is the sample size, P is the number of variables and R is 
the determinant of the correlation matrix. For large N the statistic 
Is approximately distributed as Chi Square with 1/2 P (P-1) degrees of 
freedom and has the associated hypothesis that the sample correlation 
matrix came from a multivariate normal population In which the variables 
of interest are Independent, 
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Knapp and Swoyer (I967) conducted a study of the power of 
the Bartlett test and found it to be quite substantial* They determined 
that for N=200, P=10 and^»»05 one would be virtually certain to reject 
the hypothesis stated above when the correlations were as low as 
This seems to suggest that the power of the test is highly sensitive to 
the sample size* 

The second procedure involved the i nspect ion of the off-diagonal 

elements of the anti-image covariance matrix S^R'^S : where fC is the 

inverse of the correlation matrix. The matrix S is defined as 

[diag R"^]"^ (Guttman 1953). Kaiser (1963) has summarized this 

practice as follows: 

"The preceding material suggests that G, the image covar lance 
matrix, might well be a good approximation to R-U^, the so called reduced 
correlation matrix'^ (actually the covariance matrix of the common parts 
of the tests). How can we tell if this approximation is good? >1ost 
simply by looking at the off-diagonal elements of the anti-image 
covariance matrix Q (or S^R'^S^) ... In this case if our N is essentially 
infinate, we have a comprehensive selection of tests from the universe 
of tests. If on the other hand, Q is not near-diagonal, we know that 
the approximation is poor. However, when this occurs, we have evidence 
that factor analysis is not appropriate for the data at hand. We may 
not have thoroughly covered the universe under consideration or that 
the factor analytic model may not even apply as N-^.'* 

The third procedure involved the computation of the Measure 

of Sampling Adequacy - M. S. A. (Kaiser 1970) for each matrix. The index 

is defined as: r-» r- > 



where the g^*s are the squares of the off-diagonal elements of the 
anti-image correlation matrix SR"^S and the r^*s are the squares of the 






original correlations. 



The index yields an assessment of whether the variables 
belong together psychometr ical ly and thus whether the matrix is 
appropriate for factor analysis* A similar measure may be defined for 




It gives an indication of whether a particular variable j ^'belongs to 
the family** psychometr i cal ly. Kaiser (1970) indicated that any M.S. A. 
lies between minus infinity and plus one. He further indicated that 
the index appears to be a function of four **main effects** and that 
holding the others constant it improves as: 

1. The number of variables increases. 

2. The (effective) number of factors decreases. 
3» The number of subjects increases. 

4. The general le^/el of correlation increases. 
Those three procedures were applied to four well known correlation 
matrices from the literature: 

1. The eight physical variables N=305 , (Harir \n 196?) 

2. The eight political variables N«1^7 , (Hat'^ian 196?) 

3- The Shaycroft Matrix N=3689 , (Dziuban and Harris, 1972) 
This matrix was based upon ten **Project Talent** variables 
of interest and four random deviates. 
The Armstrong and Soelberg Matrix N=50 (Armstrong and 
Soelberg, 1968) 

This matrix was entirely based upon correlations among 
20 variables which were rando.i normal deviates* 



Results : 

The Bartlett Test lead to a clear rejection of the hypothesis of interest 
for three of the four matrices — the physical variables |r|^ .58 x 10"^, 
the political variables |r|= .17 x 10"^ and the Shaycroft Matrix 
|r|= .83 X 10"3. The Armstrong Soelberg Matrix, however, exhibited an 
exact probability of .55, 1r/= .011. 

The anti-image covariance matrices S R S are pres ented in 
tables one through four. Elements which were not zero to the first 
decimal place were considered to be contributing to the non-diagonal i ty 
of the matrices. The physical variables produced eight off-diagonal 
elements of s2r-1s2 which were non zero (Table 1)^1^^. The political 
variables produced four such elements (7%) (Table ll) while the Shaycroft 
Matrix yielded ten non zero elements (Table 111)^5.5^. The Armstrong 
and Soelberg Matrix exhibited an anti-image covariance matrix with 128 
non zero off-di agonal elements (Table IV)«33^. 

The results of the application of the measure of Sampling 
Adequacy (M.S. A.) of the correlation matrices are presented in Table V, 
According to Kaiser *s present calibration: 

In the .90*s, Excel lent 

In the .80*s, Good 

In the .70*s, Fair 

In the .60' s. Poor ^ 

Below .60 , Terrible. 
The physical and political variables yielded values which would make 
them at least appropriate for factor analysis, .83 and .77. The 
Shaycroft Matrix yielded an overall M.S. A. of .91 which puts it in the 
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excellent range while the Armstrong and Soelberg Matrix yielded an 
M.S. A. value of -.38 which should be clear indication that the matrix 
should not be factor analyzed. 

Summary : 

We have applied three tests for psychometric adequacy to four well known 
correlat[orrmarr Tr um~the-l i terature. Those matrices represented 
somewhat disparate conditions under which factor analysis has been 
utilized. They ranged from sample size of 50 to over three thousand. 
The Armstrong and Soelberg Matrix was based upon the correlations of 
random numbers while the Shaycroft data were predominately composed of 
variables from '^Project Talent.** We have cited the results of the tests 
and have reached some speculative conclusions. 

Initially, the power of the Bartlett Test seems quite 
sensitive to the sample size. Knapp and Swoyer have provided evidence 
which indicates that this is true. We further suggest, however, that 
rejection of the hypothesis that the sample came from a population in 
which the variables of interest are independent may not be sufficient 
evioonre that the data'^at hand are appropriate for factor analysis. 
It seems possible that erroneous results might still be obtained by 
routinely using the Bartlett Test and then principal components - for 
instance, in the case of the Shaycroft Matrix. Four of the fourteen 
variables were random, yet were not detected or evidenced by the overall 
rejection of the hypothesis. 

The anti-image covariance matrix allows comparable 

decisions. The most nearly diagonal matrix was the Shaycroft which 
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was heavily dominated by the **TALENT** variables. The Armstrong and 

Soelberg Matrix yielded approximately one third of its elements as 

non-zero. Some appropriate decision rule seems necessary, however — 

that IS how diagonal is '^diagonal?" It seems to us that one gets a 

2-12 

feel for his data by examining the off-diagonal elements of S R S but 
that it becomes difficult to make a decision except in extreme cases* 

M. S, A. offer s advantages which the previous two procedures — 

lack. Kaiser has least provided some **gut leveV* decisions rules for the 
overall quality of the data. The second advantage of M,S»A, lies In 
the fact that it provides specific assessment of individual variables. 
For instance, the random variables were readily identified in the 
Shaycroft Matrix. Since it is usually assumed that one makes some 
apriori judgements about the variable:; which should be Incorporated 
into factor analytic investigation M.S. A. might be the logical intermediate 
step to assess the quality of those apriori judgements. Prior use of 
that procedure might increase the opportunity of realizing 'Mnterpretable** 
factors. 
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TABLE I 



AliTI IMAGE 00 VARIANCE M/ffRIX (S^R-ls^) PRTSZCAl VARIABLES 

Vaxld>le 
1 — 



2 


-.05 










3 


-.02 


-.10 








4 


-.10 


-.03 


-.04 






5 


-.04 


.04 


-.02 


-.01 




6 


-.03 


.00 


-.01 


.04 


-.16 


7 


.04 


-.02 


.03 


.04 


-.17 -.02 


8. 


.02 


-.07 


.02 


.01 


108 .08 -.08 



TABLE II 

ANTI IMAGE COVARIANCE MATRIX (S^PT^S^) POLITICAL VARIABLES ' 
Varl ebles 



1 














2 


-.11 












3 


.06 


-.07 










4 


-.03 


.01 


-.00 








5 


-.00 


.03 


.06 


-.06 






6 


.OA 


-.05 


-.02 


.03 


-.03 




7 


.02 


-.02 


.05 


-.05 


.19 


.01 


8 


.OA 


-.00 


.OA 


-.09 


.05 


.OA -.00 
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TABLE V 



m:A\-:S )V SAMPLING ADBQUACT (OVERALL AND INDIVIDUAL) 



PHYSICAL VARIABLES 
(OVERALL M.S.A. » .83) 



POLITICAL VARIABLES 
(OVERALL M.S.A. « .77) 



VARIABLE 

1 
2 
3 
4 
5 
6 
7 
8 



INDIVIDUAL M.S. A. 

.86 
.78 
.85 
.87 
.75 
.82 
.80 
.87 



VARIABLE 

I 
2 
3 
4 
5 
6 
7 
8 



INDIVIDUAL M.S. A. 

.62 
.68 
.81 
.85 
.12 
.92 
.72 
.83 



SHAyCKDFT MATRIX 
(OVEBALt M.S.A. - .91) 



ARMSTRONG AND SOELBERG MATRIX 
(OVERALL M.S. A. « -.38) 



VARIABLE 

1 

2 
3 
4 

"Talent" 5 
Variables 6 
7 
8 
9 

10 
IX 

Random 12 
Variables 13 
14 



INDIVIDUAL M.S. A. 


VARIABLE 


INDIVIDUAL 


.92 


1 


-.03 


.93 


2 


.04 


.87 


3 


-1.41 


.94 


4 


-2.11 


.89 


5 


-.30 


.93 


6 


-.34 


.93 


7 


-.22 


.87 


8 


-.28 


.92 


9 


-.78 


.86 


10 


-.38 


.68 


11 


-1.01 


.47 


12 


^.75 


,43 


13 


-.29 


-.83 


14 


-.27 




15 


-.14 




16 


.20 




17 


-.45 




18 


-.49 




19 


-.33 




20 


-.11 



